Linguistics 92 MULTIMODAL COMMUNICATION 2005 PROCEEDINGS from the Second Nordic Conference on Multimodal Communication
نویسندگان
چکیده
Speech and language science and technology evolved under the assumption that speech was a solely auditory event. However, a burgeoning record of research findings reveals that our perception and understanding are influenced by a speaker's face and accompanying gestures, as well as the actual sound of the speech. Perceivers expertly use these multiple sources of information to identify and interpret the language input. Given the value of face-to-face interaction, our persistent goal has been to develop, evaluate, and apply animated agents to produce realistic and accurate speech (Massaro, 1998). Baldi is an accurate three-dimensional animated talking head appropriately aligned with either synthesized or natural speech. Baldi has a realistic tongue and palate, which can be displayed by making his skin transparent. To implement multilingual agents, we have developed a client/server architecture system (Massaro et al., 2005; Ouni et al., 2005). The client is the application controlling Baldi. It sends text from a variety of languages including Arabic, Mandarin, and many European languages as well as English to a general speech synthesis server. The server generates the appropriate phonemes in the appropriate language with all the information needed by the client (phonemes, duration, pitches, word boundaries, etc.) and the acoustic speech waveform, and then it sends them back to the client. Using this information, the client generates the appropriate 1 Baldi is a registered trademark of Dominic W. Massaro.
منابع مشابه
Achieving Multimodal Cohesion during Intercultural Conversations
How do English as a lingua franca (ELF) speakers achieve multimodal cohesion on the basis of their specific interests and cultural backgrounds? From a dialogic and collaborative view of communication, this study focuses on how verbal and nonverbal modes cohere together during intercultural conversations. The data include approximately 160-minute transcribed video recordings of ELF interactions ...
متن کاملThe Impact of Multimodal Channels on Teaching Idiomatic Expressions to Intermediate EFL Learners with Regard to Their Attitudes
This study was to explore facilitative function of using multimodal channels over single channel presentation and comprehension of idiomatic expressions to Iranian EFL intermediate proficiency learners. Out of a pool of 90, sixty intermediate participants were homogenized by a QPT test, using a quasi-experimental design. They were randomly assigned to three equal groups: WhatsApp-, SMS- and Cla...
متن کاملHuman Language Technology Conference of the North American Chapter of the Association of Computational Linguistics Proceedings of the Doctoral Consortium
Structural information in language is important for obtaining a better understanding of a human communication (e.g., sentence segmentation, speaker turns, and topic segmentation). Human communication involves a variety of multimodal behaviors that signal both propositional content and structure, e.g., gesture, gaze, and body posture. These non-verbal signals have tight temporal and semantic lin...
متن کاملPerception and Сontent Assessment of Active Users: Russian Language Social Networks
The paper considers studying the perception and assessment of media content in the Russian-language social networks, analyzing the causes that affect the perception and distribution of network content. The importance of language learning and communication in Russian-language social networks, and problems of content effectiveness is determined by the growth in the number and activity of Runet us...
متن کاملIntroduction to the special issue on multimodal corpora for modeling human multimodal behavior
There is an increasing interest in multimodal communication as suggested by several national and international projects (ISLE, HUMAINE, SIMILAR, CHIL, AMI, CALO, VACE, CALLAS), the attention devoted to the topic by well-known institutions and organizations (the National Institute of Standards and Technology, the Linguistic Data Consortium), and the success of conferences related to multimodal c...
متن کامل